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Abstract 

Introduction: Determining bacterial community structure in fecal samples through DNA sequencing is an important facet of 
intestinal health research. The impact of different commercially available DNA extraction kits upon bacterial community 
structures has received relatively little attention. The aim of this study was to analyze bacterial communities in volunteer 
and inflammatory bowel disease (IBD) patient fecal samples extracted using widely used DNA extraction kits in established 
gastrointestinal research laboratories. 

Methods: Fecal samples from two healthy volunteers (H3 and H4) and two relapsing IBD patients (II and 12) were 
investigated. DNA extraction was undertaken using MoBio Powersoil and MP Biomedicals FastDNA SPIN Kit for Soil DNA 
extraction kits. PCR amplification for pyrosequencing of bacterial 16S rRNA genes was performed in both laboratories on all 
samples. Hierarchical clustering of sequencing data was done using the Yue and Clayton similarity coefficient. 

Results: DNA extracted using the FastDNA kit and the MoBio kit gave median DNA concentrations of 475 (interquartile 
range 228-561) and 22 (IQR 9-36) ng/ul respectively (p<0.0001). Hierarchical clustering of sequence data by Yue and 
Clayton coefficient revealed four clusters. Samples from individuals H3 and 12 clustered by patient; however, samples from 
patient II extracted with the MoBio kit clustered with samples from patient H4 rather than the other II samples. Linear 
modelling on relative abundance of common bacterial families revealed significant differences between kits; samples 
extracted with MoBio Powersoil showed significantly increased Bacteroidaceae, Ruminococcaceae and Porphyromonadaceae, 
and lower Enterobacteriaceae, Lachnospiraceae, Clostridiaceae, and Erysipelotrichaceae (p<0.05). 

Conclusion: This study demonstrates significant differences in DNA yield and bacterial DNA composition when comparing 
DNA extracted from the same fecal sample with different extraction kits. This highlights the importance of ensuring that 
samples in a study are prepared with the same method, and the need for caution when cross-comparing studies that use 
different methods. 
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Introduction 

The last decade has seen a marked rise in interest in the 
bacterial communities that coexist within humans, facilitated by 
the availability of modern molecular techniques. The Human 
Microbiome Project[l] and MetaHIT[2] have made considerable 
progress in furthering our understanding of microbial diversity and 



community structure in different body areas of healthy individuals. 
The gastrointestinal tract is the most heavily colonized organ in 
the body, with 70% of bacteria found in humans residing in the 
colon [3-5]. Differences in the diversity and community structure 
of the gut microbiota have been associated with diseases of the 
gastrointestinal tract such as inflammatory bowel disease 
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(IBD)[6,7] and irritable bowel syndrome [8] , as well as metabolic 
disorders like type 2 diabetes mellitus and obesity[9]. 

Determining the bacterial community structure in fecal samples 
through amplification and sequence analysis of extracted DNA has 
revolutionized gastrointestinal microbiology research over recent 
years. These culture-independent techniques for assessing diversity 
have largely replaced traditional culture based approaches as they 
are considered to be less biased in terms of defining true diversity 
and considerably less labor-intensive [10,1 1]. Due to the recent 
rapid increase in DNA-based phylogenetics of bacterial commu- 
nities many different DNA extraction procedures are used, each 
with its own potential biases. All methods rely on chemical or 
mechanical disruption, lysis using detergents, or a combination of 
these approaches. 

Previous studies have evaluated differences between DNA 
extraction methods from fecal samples, exploring detection with 
conventional PCR[12,13], quantitative PCR[14,15], bands on 
denaturing gradient gel electrophoresis (DGGE) [15-20] and 
phylogenetic microarray[21]. Significant differences in relative 
abundance have been demonstrated when DNA was extracted 
using different methods from mock communities of bacteria and 
assessed by 16S rRNA sequencing[22,23]. Wu et al. described the 
effect of different fecal extraction methods on 16S rRNA 
pyrosequencing, comparing QIAamp DNA Stool Minikit, MoBio 
PowerSoil DNA Isolation Kit and Stratec PSP Spin Stool DNA 
Kit[24]. 

The aim of this study was to analyze bacterial communities in 
healthy volunteer and IBD patient fecal samples extracted using 
the MoBio and FastDNA DNA extraction kits in two established 
gastrointestinal research laboratories. The MoBio Power Soil 
DNA extraction kit and the MP Biomedicals Fast DNA Spin Kit 
for Soil DNA extraction kit are two commonly used extraction 
procedures for fecal microbial diversity studies [2 5-2 7]. Both 
methods use a combination of mechanical disruption and chemical 
lysis. 

Methods 

Fecal sample collection and initial processing 

Fecal samples were taken from two patients with IBD (1 1 and 12) 
and from two healthy controls (H3 and H4) using the Fisher Fecal 
Commode Collection Kit. Fecal samples were kept at 4°C and 
processed within 4 hours of collection. This short period of storage 
is not expected to influence molecular estimation of microbial 
community composition [25]. Each sample was thoroughly mixed 
and several aliquots of 500 mg were dispensed. Aliquots were 
distributed between two established microbial research laborato- 
ries (Institute of Medical Sciences (IMS) and The Rowett Institute 
of Nutrition and Health (RINH), both Aberdeen University) and 
then subject to further processing as detailed in Figure 1 and 
described below. 

Ethics Statement 

Ethical approval was granted by North of Scotland Research 
Ethics Service (03/0137 and 12/NS/0061) on behalf of all 
participating centers and written informed consent was obtained 
from all subjects. 

MoBio PowerSoil DNA extraction procedure 

One 500 mg fecal aliquot was used for MoBio PowerSoil DNA 
isolation kit extraction. 5 ml of MoBio lysis buffer was immediately 
added to the fresh fecal sample, which was then vortex mixed for 
30-40 seconds. Fecal suspensions were then centrifuged 
(1,500 gx5 minutes) and 1 ml of the supernatant placed into the 



MoBio Garnet bead tubes containing 750 ul of MoBio buffer. 
These tubes were then heated at 65 °C for 10 minutes, then at 
95°C for 10 minutes. Samples were then stored at — 80°C prior to 
processing in both laboratories following the manufacturer's 
instructions. DNA was eluted in 100 uL MoBio elution buffer. 

FastDNA SPIN Kit for Soil procedure 

For each fecal sample 2x500 mg aliquots were placed in 
FastDNA SPIN Kit lysing matrix E tubes and 978 ju.1 of sodium 
phosphate buffer and 122 MT buffer were added to each tube 
and vortex mixed. One aliquot was then stored at — 80°C and was 
defined as FastDNA method 1 . The second aliquot was subjected 
to additional processing by heating at 65 °C for 10 minutes, then at 
95°C for 10 minutes followed by storage at — 80°C. This was 
defined as FastDNA method 2. Both aliquots were then processed 
following manufacturer's (Qbiogene, MP Biomedicals, Illkirch, 
France) instructions. DNA was eluted in 100 uL FastDNA elution 
buffer. 

PCR amplification 

Fecal DNA was quantified by Nanodrop mass spectrophotom- 
etry before dilution to 25 ng/ul. Initial PCR amplification was 
undertaken at each laboratory with Invitrogen AccuPrime Taq 
DNA Polymerase High Fidelity utilising a per-reaction mix of 2 ul 
of DNA template, 2 ul of Buffer II, 0.2 ul (2 uM) Fusion Primer A, 
0.2 ul (2 uM) Fusion Primer B, 0.08 ul (1 U) Accuprime Taq and 
15.52 (U.1 sterile, deionized water to a final volume of 20 ul. 
Quadruplicate PCR reactions were set up per DNA sample. The 
16S rRNA gene primers, spanning the V3-5 region of the 16S 
rRNA gene, were configured as follows: 338F, 5'- 
CCTATCCCCTGTGTGCCTTGGCAGTCTCAGACTCCTACGG- 
GAGGC AGC AG-3 ' , where the bases in italics are 454 Lib-L kit 
adaptor "B", and 926R, 5'- CCATCT- 
CATCCCTGCGTGTCTCCGACTCAG -Marker- 
CCGTC AATTCMTTTRAGT-3 ' , where the underlined bases 
are 454 adaptor "A" and the marker sequence was a unique 12- 
mer string of error-correcting Golay barcode bases for each 
sample [28]. No barcode was added to the forward primer. Hence 
the PCR products were flanked by a 40 bp fusion primer/ 
multiplex identifier sequence at the reverse end and a 30 bp fusion 
primer at the forward end. PCR cycling conditions were as 
follows: 2 minutes at 94°C; 20 cycles of 30 seconds at 94°C, 30 
seconds at 53°C, 120 seconds at 68°C. Following confirmation of 
adequate and appropriately sized product, the quadruplicate 
reactions were pooled and ethanol precipitated prior to purifica- 
tion as per the recommended AMPure purification method for 
454 sequencing. The PCR products were then sequenced with the 
Roche 454 Titanium sequencing platform using the Lib-L kit 
(Wellcome Trust Sanger Institute, Cambridgeshire, UK). The 
sequence data are available from the European Nucleotide 
Archive under Study Accession Numbers ERP004371 and 
ERP004372, and Sample Accession Numbers ERS373486 and 
ERS373498. The relevant barcode information for each of the 
samples is shown in Table SI. 

Quantitative PCR 

Quantitative real-time PCR was performed as described 
previously [2 9]. Briefly, standard curves consisted of ten-fold 
dilution series of amplified bacterial 16S rRNA genes from 
reference strains. Samples were amplified with universal primers 
against total bacteria and specific primers against Bacteroidaceae, 
Ruminococcaceae, Lachnospiraceae, Enterobacteriaceae (Table 1). The 
abundance of 16S rRNA gene copies was determined from 
standard curves and specific bacterial groups were expressed as a 
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Each sample 
homogenized and 
split between two 
sites - IMS and Rl 



Each site extracted 

DNAfrom each 
sample using three 
methods 



Each DNA extract 
was split between 

the two sites for 
PCR prior to 

pyrosequencing 



4 samples: 
2 healthy 
controls 
2 IBD patients 




Roche 454 Titanium pyrosequencing 



Figure 1. Study protocol. IMS: Institute of Medical Sciences, University of Aberdeen, Foresterhill, Aberdeen; Rl: Rowett Institute of Nutrition and 
Health, University of Aberdeen, Bucksburn, Aberdeen. 
doi:1 0.1 371 /journal.pone.0088982.g001 



percentage of total bacteria determined by universal primers. 5 ng 
of DNA was used per reaction. The same DNA concentration was 
used for all runs, including universal primer runs which were used 
to normalize specific bacterial groups against total bacteria, to 
minimize errors due to any inhibitory substances in the samples. 
The detection limit was determined with negative controls 
containing only herring sperm DNA. 

Bioinformatic and Statistical Analysis 

Analysis of sequence data was carried out using the Mothur 
software package [30] . Initially, the "trim.seqs" function was used 
to filter reads for quality by truncating them once average quality 



scores dropped below 35 across a rolling window of 50 bases. In 
addition all reads with any mismatches to the primer or barcode 
sequences, plus reads with ambiguous bases (i.e. "N"s) or with 
homopolymeric stretches of longer than 8 bases were removed. 
Read length following this step ranged from 336 to 351 bp. 
Chimeras were then checked for and removed using Perseus 
software [31], as implemented in Mothur. The sequences were 
then aligned to the reference SILVA database provided in 
Mothur, a distance matrix generated, and then clustered into 
operational taxonomic units (OTUs) at 97% similarity using the 
average neighbor setting in Mothur. Each OTU was assigned a 
taxonomic classification at all levels from Phylum to Genus using 



Table 1. qPCR primers used. 





Bacterial family 


Primer name 


Primer sequence 


Reference 


All bacteria 


UniF 


GTGSTGCAYGGYYGTCGTCA 


[38] 




UniR 


ACGTCRTCCMCNCCTTCCTC 




Bacteroidaceae 


Bac303F 


GAAGGTCCCCCACATTG 






Bfr-Fmrev 


CGCKACTTGGCTGGTTCAG 




Ruminococcaceae 


Clep866mF 


TTAACACAATAAGTWATCCACCTGG 






Clept1240mR 


ACCTTCCTCCGTTTTGTCAAC 




Lachnospiraceae 


Erec482F 


CGGTACCTGACTAAGAAGC 






Erec870R 


AGTTTYATTCTTGCGAACG 




En terobacteriaceae 


EnterobactDmod2F 


GACCTCGCGAGAGCA 


[29] 




Enter1432mod 


CCTACTTCTTTTGCAACCCA 





doi:1 0.1 371 /journal.pone.0088982.t001 
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the reference Ribosomal Database Project database (RDP) 
provided in Mothur with the Gemmigerl Subdoligranulum classification 
error corrected. Jaccard and Yue and Clayton distance matrices 
were calculated using the vegan package in R[32]. Dendrograms 
were generated using Ward clustering, and then visualized using 
the iTOL web package [33]. 

Comparisons in DNA yield were performed using Mann 
Whitney U testing. Linear modelling was used to assess the 
relative contribution of patient, DNA extraction method and 
extraction site to the measured proportions of different bacterial 
families. Log-transformed data was used to permit analysis of the 
fold change. A model was constructed for each bacterial family 
using the donor source, extraction method and extraction site as 
covariates. Bacterial families were reported where at least one 
sample had an abundance of 5% or more. For each family, 
samples were only included from participants with at least 0.5% 
abundance for that bacterial family in one or more of their 
samples. Modelling was also done in a similar manner using 
individual OTUs. When using linear modelling at the OTU level, 
Holm's method was used to correct for multiple testing. 
Correlation between pyrosequencing and qPCR data was done 
using Pearsons's correlation coefficient. Analysis was performed 
using R 2.15.2 (R Statistical Foundation, Vienna, Austria). 

Results 

DNA yields were significantly higher with either method of the 
FastDNA kit than with the MoBio kit, with median DNA 
concentrations of 476 ng/|jL (interquartile range [IQR] 290- 
519) for FastDNA method 1, 453 ng/uX (IQR 228-689) for 
FastDNA method 2 and 22 ng/uL (IQR 9-36) for the MoBio 
method (p<0.001 for both comparisons, Figure 2). There was no 
significant difference in yield between the two methods of the 
FastDNA kit (p = 0.798). 

Compositional analysis indicated a higher proportion of 
Enterobacteriaceae and Sutterellaceae, and lower Ruminococcaceae, in 
the samples from the two IBD patients compared with the two 
healthy controls, regardless of the extraction method or labora- 
tory. Although this study was clearly not powered to differentiate 



I p<0.001 

g p=0.798 . . p<0.001 



o 



< 




o 



1 1 1 

FastDNAI FastDNA2 MoBio 

Kit 

Figure 2. Comparison of DNA yields between extraction 
methods. 

doi:1 0.1 371 /journal.pone.0088982.g002 



between IBD cases and controls the higher observed proportions 
of Proteobacteria in cases, particularly case 12, is consistent with 
patterns described previously in IBD [6,7]. 

Clustering of the microbiota composition derived from the 
sequence data for these samples was carried out using both the 
Jaccard and the Yue and Clayton calculators. The Jaccard 
calculator is used to describe overlap in community membership 
between different samples and ignores the proportional abundance 
of each OTU while, in contrast, the Yue and Clayton calculator 
takes the proportional abundance of each OTU into account 
when comparing community similarities. Jaccard-based calcula- 
tions revealed a clear clustering of samples primarily by subject of 
origin (Figure 3a). This is as expected given the well-known inter- 
individual variation in microbiota composition between individ- 
uals [34]. Within individuals, however, the MoBio-processed 
samples tended to cluster together, separately to those processed 
using the FastDNA kit, indicating that, although there were overall 
similarities in the range of organisms that were identified using the 
two DNA extraction kits, there is some bias associated with the use 
of each kit. More serious repercussions of using different DNA 
extraction kits were observed when using the Yue and Clayton 
distance metric, where dominant organisms can have more impact 
on clustering patterns. The MoBio-processed samples of subject II 
clustered with the samples from subject H4 rather than with the 
FastDNA-processed samples from patient II, presumably as a 
result of elevated Bacteroides and lower Lachno.spiraceae proportional 
abundances in the MoBio extractions compared to the FastDNA 
extractions (Figure 3b). This demonstrates that biases introduced 
by DNA extraction methodology can over-ride the real underlying 
patterns of community structure driven by inter-individual 
variation. 

Linear modelling of the family level data for the top nine 
families represented in the pyrosequencing data is shown in 
Table 2 (range of abundances in table S2). Significant differences 
were identified between the FastDNA and MoBio kits, with 
relatively higher Bacteroidaceae, Ruminococcaceae and Porphyromonada- 
ceae, and lower Enterobacteriaceae, Lachnospiraceae, Clostridiaceae and 
Erysipelotrichaceae following extraction with the MoBio kit. There 
was a significant difference identified between the two methods of 
the FastDNA kit in just one family, the Rikenellaceae. The extraction 
site made a significant difference only for Sutterellaceae, with the 
observed differences being driven by an increase in one OTU in 
samples from patients H4 and 1 1 when they had been extracted at 
RINH (Using FastDNA methods in patient H4, relative 
abundance 0.24% (95% confidence interval 0.11-0.38%) at IMS 
and 2.64% (0.85-4.44%) at RINH.) The site at which the 
amplification PCR was performed made no significant difference 
for any of the bacterial families analyzed, and was therefore 
excluded from the models. 

At the OTU level, 18 OTUs were significantly different 
between the MoBio kit and the FastDNA kit after correction for 
multiple testing (Table 3). Of these, 10 were from the Lachnospir- 
aceae family and 8 of these 1 0 were relatively under-represented in 
the MoBio processed samples, in some cases with a complete 
absence of the OTU in the MoBio samples. 

Correlation between pyrosequencing and qPCR data was 
generally good (Figure 4), with R" values of 0.81, 0.86 and 0.94 
for Ruminococcaceae, Bacteroidaceae and Enterobacteriaceae respectively. 
However, the correlation was less good for the Lachnospiracaeae, 
with an R z value of 0.42. Linear modelling revealed similar 
differences to that seen in the pyrosequencing data, although the 
differences related to extraction method were only significant for 
Ruminococcaceae and Enterobacteriaceae (Tables 4, S3). 
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Figure 3. Dendrogram of the representation of bacterial families derived from 16S rRNA gene sequences within each sample 
clustered by Jaccard (A) and Yue and Clayton (B) distances. 

doi:1 0.1 371 /journal.pone.0088982.g003 



Discussion 

With the recognition that cultured bacteria cover only a small 
proportion of gut microbial diversity [35], a number of molecular 
techniques have been developed to describe and quantify the gut 
microbiota, from qualitative gel-based methods to full metage- 
nomic sequencing[5,36]. Almost all of these techniques require 
extraction of DNA from fecal or mucosal samples as a first step, 
and differences at this point will influence downstream results. The 
importance of this will be amplified if, for example, cases and 
controls are processed in a different manner. 

This study highlights important differences in the performance 
of two commercially available kits for DNA extraction from fecal 
samples. Significantly lower DNA yields were seen with the MoBio 
kit than the FastDNA kit. This is consistent with results published 
previously [19]. More importantly, there were significant differ- 
ences in the relative abundance of bacteria measured at both the 
family and OTU level. There is no gold standard to which these 
data can be compared, and so it is impossible to say which 
technique yields results closer to the true profile of the samples. 
However, the lower yield of the MoBio kit, and reduced 
proportional abundance of the Lachnospiraceae family of Firmicutes, 
suggests that this kit may not be stringent enough for optimal 
lysing of some Gram-positive organisms. Regardless, these 
differences are such that it is important to stipulate that all 
samples in a particular experiment should be extracted using the 
same technique. This is of particular importance with multicenter 
studies. Moreover, it should prompt researchers to exercise 
caution when comparing datasets from different studies. Indeed, 
if DNA has been extracted using different kits then studies should 
not be considered cross-comparable. Of note, a recent meta- 
analysis found that samples from studies of fecal microbiota within 
Western populations clustered by study, suggesting that systematic 
bias was introduced by factors such as DNA extraction 
technique [37]. 

The importance of the observed differences will depend on the 
analysis techniques used. However, whenever a relative quantifi- 

Table 2. Linear modelling of family-level pyrosequencing data. 



cation technique is used, the results for even a single organism will 
be influenced by the effects of the extraction technique on the total 
number of bacteria isolated relative to that specific species. The 
methods for both kits used here involved physical disruption by 
bead-beating. Methods that rely on enzymatic treatment without 
physical disruption have been shown previously to give biased 
recovery, with reduced recovery of Gram-positive organisms and 
artificially elevated levels of Gram negatives, presumably because 
these are more easily lysed[18,22]. 

A smaller effect was observed of the extraction site on relative 
abundance, with only Sutterellaceae reaching statistical significance. 
This may reflect a difference in operator, equipment or laboratory 
environment. To minimize the influence of differences between 
laboratories, centralization of DNA extraction for an experiment 
would be preferred. The technique described here includes only 
minimal processing after sample collection prior to interim storage 
at — 80°C. This allows for collection sites to collate a number of 
samples in — 80°C storage prior to shipment to a central facility for 
DNA extraction and downstream analysis. 

The qPCR data in general correlated well with that from 
pyrosequencing with the exception of Lachnospiracaeae. This can be 
partially explained by differences between the range of organisms 
that were targeted by the qPCR primers and those that were 
classified as Lachnospiracaeae in the pyrosequencing data, although 
78% of OTUs and 89% of sequences labelled as Lachnospiracaeae 
were estimated in silico to be targeted by the qPCR primer set used. 

Ariefdjohan et al. previously assessed the effect of DNA 
extraction method on the measured bacterial composition of stool 
using denaturing gradient gel electrophoresis (DGGE)[19]. This 
study demonstrated variability in bacterial community between 
fecal samples extracted with QJAamp DNA, MoBio Ultra Clean 
Fecal DNA and FastDNA SPIN kits, and noted that both the 
MoBio and Qiagen kits were not able to extract DNA from all the 
bacteria in the specimen. More recently, Claassen et al. used 
DGGE, terminal restriction fragment length polymorphism (T- 
RFLP) and qPCR to compare fecal samples extracted using kits 
from Qiagen, ZymoResearch and MoBio and found few 





Bacterial Family 


Kit 








Extraction Site 








FastDNA 2 fold 
change 


P 


MoBio fold change 


P 


RINH fold change 


P 


Patients 
included 


Lachnospiraceae 


0.96 (0.74-1.25) 


0.775 


0.63 (0.49-0.81) 


0.001 


1.17 (0.95-1.44) 


0.160 


H3,H4,I1,I2 


Bacteroidaceae 


1.13 (0.79-1.63) 


0.501 


2.13 (1.49-3.05) 


<0.001 


1.09 (0.81-1.46) 


0.561 


H3,H4,I1,I2 


Ruminococcaceae 


0.94 (0.79-1.13) 


0.524 


1.32 (1.11-1.58) 


0.005 


0.95 (0.82-1.10) 


0.516 


H3,H4,I1 


En terobacteriaceae 


1.08 (0.74-1.57) 


0.695 


0.61 (0.43-0.88) 


0.016 


0.85 (0.63-1.15) 


0.311 


11,12 


Sutterellaceae 


0.77 (0.18-3.37) 


0.735 


1.11 (0.26-4.69) 


0.892 


3.84 (1.18-12.46) 


0.031 


H3,H4,I1,I2 


Clostridiaceae 


1 .00 (0.77-1 .30) 


0.976 


0.46 (0.36-0.59) 


<0.001 


0.88 (0.71-1.08) 


0.243 


11,12 


Porphyromonadaceae 


1.46 (0.41-5.19) 


0.560 


4.03 (1.16-14.01) 


0.035 


0.70 (0.26-1 .94) 


0.502 


H3,H4,I1,I2 


Erysipelotrichaceae 


1.21 (0.81-1.81) 


0.361 


0.32 (0.21-0.47) 


<0.001 


0.88 (0.64-1 .22) 


0.445 


H3,H4,I1,I2 


Rikenellaceae 


0.35 (0.16-0.76) 


0.016 


0.72 (0.33-1 .56) 


0.418 


0.65 (0.35-1.19) 


0.181 


H3.H4 



RINH: Rowett Institute of Nutrition and Health. 

Participants were excluded if all data points for that bacterial family were < 0.5%. Reference sample was from participant H3 using FastDNA method 1 and extracted at 
the Institute of Medical Sciences. Differences are shown as fold change with 95% confidence intervals. 
doi:1 0.1 371 /journal.pone.0088982.t002 
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Table 3. Multiple linear modelling after correction for multiple testing shows OTUs with significantly different relative abundance 
after extraction with the MoBio kit. 





Genus 


Family 


Order 


Class 


Phylum 


Fold change 


P 




Corrected p 


Patients 
included 


Eggerthella 


Corio bacteriaceae 


Coriobacteriales 


Actinobacteria 


Actinobacteria 


0.00 


(0.00-0.00) 


5.09x10 


-9 


5.55x10" 


-7 


11 


Blautia 


Lachnospiraceoe 


C lost rid iales 


Clostridia 


Firmicutes 


0.00 


(0.00-0.01) 


1 .70 x 10 


-7 


1.83x10" 


-5 


H3,H4,I1 


Blautia 


Lachnospiraceae 


Clostridiales 


Clostridia 


Firmicutes 


0.01 


(0.00-0.04) 


2.01 xlO 


-7 


2.15x10" 


-5 


H3,H4,I1 


Bacteroides 


Bacteroidaceae 


Bacteroidales 


Bacteroidia 


Bacteroidetes 


2.60 


(1.92-3.52) 


3.26x10 


-7 


3.46x10" 


-5 


H3,H4,I1,I2 


Clostridium sensu Clostridiaceae 
stricto 


Clostridiales 


Clostridia 


Firmicutes 


0.36 


(0.30-0.43) 


3.91 xlO 


-6 


0.0004 




11 


Blautia 


Lachnospiraceae 


C 1 n cti-i H i a 1 £»c 

v_ 1 KJ J 1 1 1 KA 1 a 1 L. > 


C\r\^tjf\r\'\^ 

\_ll_O LI K.I Id 


Firmi/~i i1"f^c 

III 1 1 1 li_U IL J 


0 01 


(n nn n r\^\ 


1 97x10 


-5 


0 0020 




H3 H4 11 


unclassified 


Ruminococcaceae 


f~ 1 n c1"r iH i a 1 £»c 

v_ 1 KJ J 1 1 lUlultTJ 


\_ll_o LI Kl Id 


Firmifi i1"f^c 

III 1 1 1 li_U Icj 


0 17 


in 1 1 n ~)A\ 


3 13x10 


-5 


0 0032 




H3 


unclassified 


Lachnospiraceae 


flAct rffiiAl&c 

v_ 1 KJ J L 1 1 U 1 a 1 L :> 


v_ \kj ^ li iu id 


Firmi/~i i1"f^c 

r 1 1 1 1 1 li_U IL J 


0 00 


(n nn n nil 
lu.uu— U.U I ) 


3 49 x 10 


-5 


0 0036 




H4 11 


Anaerostipes 


Lachnospiraceae 


Clostridiales 


Clostridia 


Firmicutes 


0.10 


(0.04-0.23) 


5.23x10" 


-5 


0.0053 




H3,H4 


Escherichia 
Shigella 


Enterobacteriaceae 


Enterobacteriales 


Gamma - 
proteobacteria 


Proteo bacteria 


0.41 


(0.28-0.59) 


1 .36 x 10 


-4 


0.0136 




11,12 


unclassified 


Lachnospiraceae 


Clostridiales 


Clostridia 


Firmicutes 


5.05 


(2.55-9.97) 


2.24x10 


-4 


0.0222 




H3,I1 


Ruminococcus 


Ruminococcaceae 


Clostridiales 


Clostridia 


Firmicutes 


0.48 


(0.34-0.68) 


0.0002 




0.0232 




H3,H4,I1 


unclassified 


Lachnospiraceae 


Clostridiales 


Clostridia 


Firmicutes 


2.50 


(1.92-3.26) 


0.0003 




0.0250 




H3 


unclassified 


Ruminococcaceae 


Clostridiales 


Clostridia 


Firmicutes 


3.19 


(2.26-4.50) 


0.0003 




0.0298 




H3 


Bacteroides 


Bacteroidaceae 


Bacteroidales 


Bacteroidia 


Bacteroidetes 


2.03 


(1.42-2.89) 


0.0004 




0.0342 




H3,H4,I1,I2 


Dorea 


Lachnospiraceae 


Clostridiales 


Clostridia 


Firmicutes 


0.00 


(0.00-0.06) 


0.0004 




0.0343 




H3,H4,I1 


unclassified 


Lachnospiraceae 


Clostridiales 


Clostridia 


Firmicutes 


0.00 


(0.00-0.00) 


0.0004 




0.0358 




11 


Dorea 


Lachnospiraceae 


Clostridiales 


Clostridia 


Firmicutes 


0.06 


(0.01-0.25) 


0.0004 




0.0381 




H3,H4,I1,I2 



Samples were excluded if all data points for that bacterial family were <0.5%. Reference sample was from patient H3 using either FastDNA method and extracted at the 
Institute of Medical Sciences. Differences are shown as fold change with 95% confidence intervals. 
doi:1 0.1 371 /joumal.pone.0088982.t003 
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Figure 4. Correlation between pyrosequencing and qPCR data. 
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Table 4. Linear modelling of qPCR data. 





Bacterial Family 


Kit 








Extraction Site 








FastDNA 2 fold changep 


MoBio fold change 


P 


RINH fold change 


P 


Patients included 


Lachnospiraceae 


0.75 (0.48-1.17) 


0.209 


0.74 (0.48-1.16) 


0.199 


1.35 (0.94-1.94) 


0.107 


H3,H4,I1,I2 


Bacteroidaceae 


1.19 (0.94-1.51) 


0.147 


1.25 (0.99-1.57) 


0.066 


0.98 (0.81-1.18) 


0.822 


H3,H4,I1,I2 


Ruminococcaceae 


0.69 (0.47-1.01) 


0.070 


2.32 (1.58-3.39) 


< 0.001 


1.26 (0.92-1.71) 


0.157 


H3,H4,I1 


Enterobacteriaceae 


1.28 (0.97-1.69) 


0.102 


0.65 (0.48-0.87) 


0.011 


0.91 (0.72-1.15) 


0.436 


11,12 



RINH: Rowett Institute of Nutrition and Health. 

Participants were excluded if all data points for that bacterial family were <0.5%. Reference sample was from participant H3 using FastDNA method 1 and extracted at 
the Institute of Medical Sciences. Differences are shown as fold change with 95% confidence intervals. 
doi:1 0.1 371 /journal.pone.0088982.t004 



significant differences [20] . In contrast, the previous study by Wu et 
al. which assessed the effect of extraction methods on 16S rRNA 
pyrosequencing demonstrated increased yield of Firmicutes when 
a hot phenol bead-beating method or the PSP kit were used. The 
present study helps bring further clarity to this important issue 
with next generation sequencing permitting a more detailed 
exploration of the differences between samples extracted with 
different methods. 

This study is somewhat limited by its relatively small sample 
size, with fecal samples obtained from only four individuals. There 
were a small number of outliers; samples H4F2AA and H4F2AR 
had much higher relative abundance of Bacteroidaceae. In addition, 
the data obtained here from both pyrosequencing and qPCR 
estimate relative abundance rather than absolute numbers and 
focus on the dominant groups within the microbiota. 

Conclusions 

This study demonstrates important differences in the yield and 
relative abundance of key bacterial families for kits used to isolate 
bacterial DNA from stool. This highlights the importance of 
ensuring that all samples to be analyzed together are prepared 
with the same DNA extraction method, and the need for caution 
when comparing studies that have used different methods. 
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Table SI Barcodes used for each sample included in the study, 
and the respective ENA-deposited dataset that they can be 
recovered from. 
(XLSX) 

Table S2 Relative abundances of the top nine bacterial families 
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(DOCX) 

Table S3 Relative abundances of the bacterial families mea- 
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(DOCX) 

Acknowledgements 

UK IBD Genetics Consortium individual authors and affiliations: Lead = 
Parkes M: Inflammatory Bowel Disease Research Group, Addenbrooke's 
Hospital, University of Cambridge, Cambridge CB2 0Q_Q, UK. 

Other members not named within the manuscript author list 
(alphabetical by surname): 

Ahmad T: Peninsula College of Medicine and Dentistry University of 
Exeter, EX1 2LU, UK. 



Anderson CA: The Wellcome Trust Sanger Institute, Wellcome Trust 
Genome Campus, Hinxton, Cambridge, CB10 ISA, UK. 

Barrett JC: The Wellcome Trust Sanger Institute, Wellcome Trust 
Genome Campus, Hinxton, Cambridge, CB10 ISA, UK. 

Drummond H: Gastrointestinal Unit, Centre for Genomic and 
Experimental Medicine, Institute of Genetics and Molecular Medicine, 
University of Edinburgh, Edinburgh EH4 2XU, UK. 

Edwards C: Department of Gastroenterology, Torbay Hospital, Torbay 
TQ2 7AA, UK. 

Hart A: Inflammatory Bowel Disease Unit, St Mark's Hospital, Watford 
Road, Harrow, Middlesex HA1 3UJ, UK. 

Hawkey C: Nottingham Digestive Diseases Centre, Queen's Medical 
Centre, Nottingham NG7 1AW, UK. 

Henderson P: Royal Hospital for Sick Children, Paediatric Gastroen- 
terology and Nutrition, Glasgow G3 8SJ, UK. 

Khan M: Department of Medical Genetics, Manchester Academic 
Health Science Centre (MAHSC), University of Manchester, Manchester 
Ml 3 OJH, UK. 

Lamb CA: Department of Gastroenterology & Hepatology, University 
of Newcastle upon Tyne, Royal Victoria Infirmary, Newcastle upon Tyne 
NE1 4LP, UK. 

Lee JC: Inflammatory Bowel Disease Research Group, Addenbrooke's 
Hospital, University of Cambridge, Cambridge CB2 OQQ, UK. 

Mansfield JC: Department of Gastroenterology & Hepatology, Univer- 
sity of Newcastle upon Tyne, Royal Victoria Infirmary, Newcastle upon 
Tyne NE1 4LP, UK. 

Mathew CG: Department of Medical and Molecular Genetics, King's 
College London School of Medicine, 8th Floor Guy's Tower, Guy's 
Hospital, London, SE1 9RT, UK. 

Mowat C: Department of General Internal Medicine, Ninewells 
Hospital and Medical School, Ninewells Avenue, Dundee DDI 9SY, UK. 

Newman WG: Department of Medical Genetics, Manchester Academic 
Health Science Centre (MAHSC), University of Manchester, Manchester 
Ml 3 OJH, UK. 

Prescott NJ: Department of Medical and Molecular Genetics, King's 
College London School of Medicine, 8th Floor Guy's Tower, Guy's 
Hospital, London, SE1 9RT, UK. 

Simmons A: Translational Gastroenterology Unit, Experimental Med- 
icine Division, John Radcliffe Hospital, Headington, Oxford OX3 9DU, 
UK. 

Simpson P: Translational Gastroenterology Unit, Experimental Medi- 
cine Division, John Radcliffe Hospital, Headington, Oxford OX3 9DU, 
UK. 

Taylor K: Department of Gastroenterology, Guy's & St Thomas' NHS 
Foundation Trust, St Thomas' Hospital, London SE1 7EH, UK. 

Wilson DC: Child Life and Health, University of Edinburgh, Edinburgh 
EH9 1UW, UK. 

Author Contributions 

Conceived and designed the experiments: NK AW PL SD HFJPJS CL 
GH. Performed the experiments: NK AW SB SD FF JT. Analyzed the 
data: NK AW PL GH. Wrote the paper: NK GH. 



PLOS ONE | www.plosone.org 



8 



February 2014 | Volume 9 | Issue 2 | e88982 



Extraction Kit Impact on Fecal Bacterial Diversity 



References 

1. The Human Microbiomc Project (2012) Structure, function and diversity of the 
healthy human microbiomc. Nature 486: 207-214. doi: 10.1038/naturcl 1234. 

2. Qin J, Li R, Racs J, Arumugam M, Burgdorf KS, ct al. (2010) A human gut 
microbial gene catalogue established by metagenomic sequencing. Nature 464: 
59-65. doi:10.1038/nature08821. 

3. Whitman WB, Coleman DC, Wicbe WJ (1998) Perspective Prokaryotes: The 
unseen majority. 95: 6578-6583. 

4. Ley RE, Peterson DA, Cordon JI (2006) Ecological and evolutionary forces 
shaping microbial diversity in the human intestine. Cell 124: 837-848. 
doi:10.1016/j.ceU.2006.02.017. 

5. Sckirov I, Russell SL, Antunes LCM, Einlay BB (2010) Gut microbiota in health 
and disease. Physiol Rev 90: 859-904. doi:10.1 152/physrcv.00045.2009. 

6. Sartor RB (2006) Mechanisms of disease: pathogenesis of Crohn's disease and 
ulcerative colitis. Nat Clin Pract Gastroenterol Hepatol 3: 390-407. 
doi: 1 0. 1038/ncpgasthep0528. 

7. Manichanh C, Borruel N, Casellas E, Guarner F (2012) The gut microbiota in 
IBD. Nat Rev Gastroenterol Hepatol 9: 599-608. doi:10.1038/nrgas- 
tro.2012.152. 

8. Simrcn M, Barbara G, Flint HJ, Spiegel BMR, Spillcr RC, ct al. (2013) 
Intestinal microbiota in functional bowel disorders: a Rome foundation report. 
Gut 62: 159-176. doi:10.1 136/gutjnl-2012-302167. 

9. Musso G, Gambino R, Cassadcr M (201 1) Interactions between gut microbiota 
and host metabolism predisposing to obesity and diabetes. Annu Rev Med 62: 
36 1 -380. doi: 1 0. 1 1 46/annurcv-mcd-0 1 25 1 0- 1 75505. 

10. Zoctcndal EG, Collier CT, Koike S, Mackic RI, Gaskins HR (2004) Molecular 
ecological analysis of the gastrointestinal microbiota: a review. J Nutr 1 34: 465— 
472. 

11. Amann RI, Ludwig W, Schleifer KH (1995) Phylogenetic identification and in 
situ detection of individual microbial cells without cultivation. Microbiol Rev 59: 
143-169. 

12. Pcrsson S, dc Boer RE, Kooistra-Smid AMD, Olscn KEP (2011) Five 
commercial DNA extraction systems tested and compared on a stool sample 
collection. Diagn Microbiol Infect Dis 69: 240-244. doi:10.1016/j.diagmicro- 
bio.2010.09.023. 

13. McOrist AL, Jackson M, Bird AR (2002) A comparison of five methods for 
extraction of bacterial DNA from human faecal samples. J Microbiol Methods 
50: 131-139. doi:10.1016/S0167-7012(02)00018-0. 

14. Ncchvatal JM, Ram JL, Basson MD, Namprachan P, Niec SR, et al. (2008) 
Fecal collection, ambient preservation, and DNA extraction for PGR 
amplification of bacterial and human markers from human feces. J Microbiol 
Methods 72: 124-132. doi:10. 1016/j.mimet.2007.1 1.007. 

15. MaukonenJ, Simoes C, Saarela M (2012) The currently used commercial DNA- 
cxtraction methods give different results of clostridial and actinobacterial 
populations derived from human fecal samples. EEMS Microbiol Ecol 79: 697— 
708. doi: 10. 1111 /j. 1574-6941 .20 1 1 .0 1 257.x. 

16. Smith B, Li N, Andersen AS, Slotved HC, Krogfclt KA (2011) Optimising 
bacterial DNA extraction from faecal samples: comparison of three methods. 
Open MicrobiolJ 5: 14-17. doi:10.21 74/ 1874285801 105010014. 

17. Yu Z, Morrison M (2004) Improved extraction of PCR-quality community DNA 
from digesta and fecal samples. Biotechniqucs 36: 808-812. 

18. Zoctcndal EG, Ben-Amor K, Akkermans AD, Abee T, de Vos WM (2001) DNA 
isolation protocols affect the detection limit of PCR approaches of bacteria in 
samples from the human gastrointestinal tract. Syst Appl Microbiol 24: 405—410. 
doi:10. 1078/0723-2020-00060. 

19. Aricfdjohan MW, Savaiano DA, Nakatsu CH (2010) Comparison of DNA 
extraction kits for PCR-DGGE analysis of human intestinal microbial 
communities from fecal specimens. Nutr J 9: 23. doi:10.1 186/1475-2891-9-23. 

20. Claasscn S, du Toit E, Kaba M, Moodley C, Zar HJ, et al. (2013) A comparison 
of the efficiency of five different commercial DNA extraction kits for extraction 
of DNA from faecal samples. J Microbiol Methods 94: 103-110. dof.10.1016/ 
j.mimet.2013.05.008. 



21. Salonen A, NikkilaJ, Jalanka-Tuovinen J, Immonen O, Rajilic-Stojanovic M, ct 
al. (2010) Comparative analysis of fecal DNA extraction methods with 
phylogenetic microarray: effective recovery of bacterial and archacal DNA 
using mechanical cell lysis. J Microbiol Methods 81: 127-134. doi:10.1016/ 
j.mimet.2010.02.007. 

22. Yuan S, Cohen DB, Ravel J, Abdo Z, Forney LJ (2012) Evaluation of methods 
for the extraction and purification of DNA from the human microbiomc. PLoS 
One 7: e33865. doi:10.1371/journal.pone.003.3865. 

23. Milani C, Hevia A, Foroni E, Duranti S, Turroni F, ct al. (2013) Assessing the 
fecal microbiota: an optimized ion torrent 16S rRNA genc-based analysis 
protocol. PLoS One 8: c68739. doi:10.1371/journal.pone.0068739. 

24. Wu GD, Lewis JD, Hoffmann C, Chen Y-Y, Knight R, ct al. (2010) Sampling 
and pyrosequencing methods for characterizing bacterial communities in the 
human gut using 16S sequence tags. BMC Microbiol 10: 206. doi:10.1186/ 
1471-2180-10-206. 

25. Lauber CL, Zhou N, Gordon JI, Knight R, Ficrcr N (2010) Effect of storage 
conditions on the assessment of bacterial community structure in soil and 
human-associated samples. FEMS Microbiol Lett 307: 80-86. doi: 10. 1111/ 
j.l574-6968.2010.01965.x. 

26. Walker AW, Ince J, Duncan SH, Webster LM, Holtrop G, ct al. (2011) 
Dominant and diet-rcsponsivc groups of bacteria within the human colonic 
microbiota. ISMEJ 5: 220-230. doi:10.1038/ismej.2010. 1 18. 

27. Costcllo EK, Lauber CL, Hamady M, Ficrcr N, Gordon JI, ct al. (2009) 
Bacterial community variation in human body habitats across space and time. 
Science 326: 1694-1697. doi: 10. 1 126/sciencc'l 177486. 

28. Ficrcr N, Hamady M, Lauber CL, Knight R (2008) The influence of sex, 
handedness, and washing on the diversity of hand surface bacteria. Proc Nad 
Acad Sci USA 105: 17994-17999. doi:10.1073/pnas.0807920105. 

29. Hansen R, Russell RK, RciffC, Louis P, Mcintosh F, ct al. (2012) Microbiota of 
Dc-Novo Pediatric IBD: Increased Faecalibactenum Prausnitzii and Reduced 
Bacterial Diversity in Crohn's But Not in Ulcerative Colitis. Am J Gastroenterol: 
1-10. doi:10.1038/ajg.2012.335. 

30. Schloss PD, Wcstcott SL, Ryabin T, Hall JR, Hartmann M, ct al. (2009) 
Introducing mothur: open-source, platform-independent, community-supported 
software for describing and comparing microbial communities. Appl Environ 
Microbiol 75: 7537-7541. doi: 10.1 128/AEM.01541-09. 

31. Quince C, Lanzen A, Davenport RJ, Turnbaugh PJ (2011) Removing noise 
from pyroscquenced amplicons. BMC Bioinformatics 12: 38. doi: 1 0. 1 186/147 1- 
2105-12-38. 

32. Yue JC, Clayton MK, Lin FC (2001) A nonparametric estimator of species 
overlap. Biometrics 57: 743-749. doi:10. 1 1 1 l/j.0006-341X.2001.00743.x. 

33. Letunic I, Bork P (2011) Interactive Tree Of Life v2: online annotation and 
display of phylogenetic trees made easy. Nucleic Acids Res 39: W475-8. 
doi:10.1093/nar/gkr201. 

34. Flint HJ, Scott KP, Louis P, Duncan SH (2012) The role of the gut microbiota in 
nutrition and health. Nat Rev Gastroenterol Hepatol 9: 577-589. doi: 10. 1038/ 
nrgastro.2012.156. 

35. Eckburg PB, Bik EM, Bernstein CN, Purdom E, Dethlefsen L, et al. (2005) 
Diversity of the human intestinal microbial flora. Science 308: 1635-1638. 
doi:10.1126/scicncc. 1110591. 

36. Lawley B, Tannock GW (2012) Nucleic acid-based methods to assess the 
composition and function of the bowel microbiota. Gastroenterol Clin North 
Am 41: 855-868. doi:10.1016/j.gtc.2012.08.010. 

37. Lozuponc CA, Stombaugh J, Gonzalez A, Ackcrmann G, Wendel D, et al. 
(2013) Meta-analyses of studies of the human microbiota. Genome Res 23: 
1704-1714. doi: 1 0. 1 1 0 1 /gr. 1 5 1 803. 1 1 2. 

38. Ramirez-Farias C, Slezak K, Fuller Z, Duncan A, Holtrop G, ct al. (2009) Effect 
of inulin on the human gut microbiota: stimulation of Bifidobacterium adolescentis 
and Faecatibacterium prausnitzii- Br J Nutr 101: 541-550. doi: 10.1017/ 
S00071 14508019880. 



PLOS ONE | www.plosone.org 



9 



February 2014 | Volume 9 | Issue 2 | e88982 



